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Abstract 

We study a gossip protocol called forwarding without repeating (fwr). The objective is to 
spread multiple rumors over a graph as efficiently as possible, fwr accomplishes this by having 
nodes record which messages they have forwarded to each neighbor, so that each message is 
forwarded at most once to each neighbor. We prove that fwr spreads a rumor over a strongly 
connected digraph, with high probability, in time which is within a constant factor of optimal 
for digraphs with bounded out-degree. Moreover, on digraphs with bounded out-degree and 
bounded number of rumors, the number of transmissions required by fwr is arbitrarily better 
than that of existing approaches. Specifically, fwr requires 0(n) messages on bounded-degree 
graphs with n nodes, whereas classical forwarding and an approach based on network coding 
both require w(n) messages. Our results are obtained using combinatorial and probabilistic 
arguments. Notably, they do not depend on expansion properties of the underlying graph, and 
consequently the message complexity of fwr is arbitrarily better than classical forwarding even 
on constant-degree expander graphs, as n — > oo. In resource-constrained applications, where 
each transmission consumes battery power and bandwidth, our results suggest that using a small 
amount of memory at each node leads to a significant savings. 



1 Introduction 



We consider the now widely-studied problem of spreading rumors over a graph. Initially a subset of 
nodes have a rumor and we are interested in how long it takes to spread these messages to all nodes 
in the graph. In this paper we are particularly interested in a variant of the classical push-based 
forwarding algorithm. In the variant, which we refer to as forwarding without repeating, a node 
already holding one rumor forwards it to a neighbor only one time. Thus, each node must record 
which rumors it has sent to which neighbors. In bounded degree networks, if the number of rumors 
is bounded, we show that the number of transmissions required for forwarding without repeating to 
spread the rumors to all nodes is infinitely smaller than that of standard (push-based) forwarding. 
Moreover, the delay of forwarding without repeating is within a constant factor of optimal for any 
strongly connected graph. 

Our motivation comes from the variety of resource-constrained applications, where reducing the 
number of transmissions is important for efficiency of the system. These include efficient broadcast 
of code updates in networks of embedded wireless sensing and/or control devices [12\ [T3] and 
content dissemination in mobile social networks |9j. In these applications, devices have limited 
onboard energy resources and limited bandwidth. Reducing the number of transmissions can both 
free up communication resources and conserve energy resources. The forwarding without repeating 
algorithm considered here requires the use of additional memory at each node. However, in networks 
with bounded degree and where the goal is to spread a bounded number of rumors, we argue that 
using additional memory resources at each node may be beneficial if it leads to considerable savings 
in the number of transmissions. 

The literature on rumor spreading is vast. Here we briefly review the developments which are 
most relevant to our discussion. The seminal work of Frieze and Grimmet [8] introduces the problem 
of rumor spreading (a.k.a. gossip) and studies push-based gossip protocols on complete graphs. De- 
mers et al. [6] consider the use of gossip algorithms for lazy updating of distributed databases. They 
characterize the performance of push-, pull-, and push-pull algorithms, again on complete graphs. 
Chierichetti, Lattanzi, and Panconesi [3] provide the best-known upper bound on the number of 
iterations required for synchronous gossip (where each node communicates with one neighbor at 
each iteration) to spread a rumor over the entire network. They show that in a network of n nodes 

with conductance cfi, the push-pull strategy reaches every node within 0( log t — • logn) rounds, 
with high probability. In addition, they show that in graphs which satisfy a regularity condition 
(all nodes have the same degree, to within a multiplicative factor), the push and pull strategies 
satisfy similar bounds on their own. Relatively little work has focused on bounding the number 
of transmissions required to spread rumors (with the exceptions of [10] and [7], which characterize 
the message complexity for complete graphs and Erdos-Renyi random graphs, respectively). Most 
work in this direction is of the form: if a network of n nodes requires 0(r) rounds to spread a 
message, then the number of transmissions is no more than 0(n ■ r). 

In a slightly different direction, algebraic gossip protocols are proposed and studied in [5]. 
There the problem of spreading multiple messages is considered, and a network coding approach is 
adopted. Rumors are each finite-length bitstrings, and rather than gossiping on individual strings, 
the authors consider schemes where nodes form and transmit random linear combinations of the 
messages they have already received at each iteration. After every node has received sufficiently 
many of these coded strings, it can recover the individual rumors by solving a system of linear 
equations. In graphs with good expansion properties, this leads to a scheme where all nodes receive 
all messages in less time than it would take to spread all messages individually (without coding). 

In this paper we compare three approaches to rumor spreading: classical (push-based) forward- 
ing (for), the network coding approach (nca), and forwarding without repeating (fwr). We prove 
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the following results: 



• First, we analyze f wr in the synchronous time setting and we show that the number of rounds 
required for it to converge is within a constant factor of the diameter of the network, which 
is a trivial lower bound for rumor spreading on any graph. 

• Next, we analyze all three algorithms in the asynchronous time setting, where only a single pair 
of neighboring nodes exchange messages in each round. A trivial lower bound on the number 
of rounds required in this setting is n, the number of nodes in the network. For strongly 
connected (di)graphs with bounded out-degree, we show that f wr is within a constant factor 
of this lower bound. Moreover, when considering any family of graphs with bounded degree, 
we show that both for and nca require a number of rounds which is u:(n) (for n — > oo). Since 
the number of transmissions is proportional to the number of rounds in the asynchronous 
setting, this implies that fwr is arbitrarily more efficient than both for and nca in large 
networks in terms of message complexity. 

Our proofs use a combination of combinatorial and probabilistic arguments. Notably, our results 
do not involve assumptions about the conductance or other expansion-related properties of the 
underlying graph. Hence, these conclusions apply even to the family of constant-degree expander 
graphs [15]. Another notable feature of this work is that, in contrast to most previous work which 
assumes the graph is undirected, our results are obtained in the general setting of strongly connected 
directed graphs. 

2 Notation, Assumptions, and Problem Formulation 

Consider a connected directed graph Q = (V, £), where V is the finite set of vertices, |V| = n, and 
£ C {(u,v) : u,t) G V,u / d} is the set of directed edges. Each vertex represents an agent in a 
distributed system (e.g., terminals in a communication network). Edges represent communication 
media (e.g., wires in an Ethernet network, proximity in a wireless network). We say that a vertex 
v' is a successor of the vertex v if (v, v') G £. We also say that v is a predecessor of v' . A neighbor 
of v is a vertex that is either its predecessor or its successor. 

We impose the gossip transmitter constraint, which implies that at any given time, any vertex in 
the network can only send a bitstring to at most one of its successors (see also the definition in [14J). 
Conversely, a vertex may receive multiple bitstrings from different predecessors at the same time. 
This assumption is reasonable considering a wired network, for instance, where multiple incoming 
messages can be buffered. 

The out-degree d out (v) (respectively, in-degree d m (v)) of a vertex v is the number of successors 
(respectively, predecessors) it has, namely, 

d out {v) = \{v' : (v,v') e £}\ and d in (v) = \{v' : (v',v) G £}\ . 

The out-degree and in-degree of the graph Q are the maximum out-degree and the maximum 
in-degree, respectively, taken over all vertices in Q: 

d ou \Q) = maxd out (v) and d in (G) = max<F». 
i>gv vev 

A path in Q is a sequence of vertices Vq, vi, . . . ,V£ such that for all i = 0, 1, • • • ,£ — 1 holds 
(vi, Uf-fi) € £; I is called the length of the path. A shortest path between v and v' is a path such 
that the first vertex is vq = v, the last is V£ = v' and the length of the path is minimum over all 
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paths from v to v'. We refer to the distance between v and v 1 as the length of the shortest path. 
(Note that, in general, this distance is not a proper metric; it is not necessarily symmetric.) The 
diameter D(Q) is defined as the maximum length of a shortest path between any two vertices in Q. 
If there exists a path between all v G V and v' G V, we say that the graph is strongly connected. If 
n > 2, a strongly connected graph is such that for all v G V holds d out {v) > 1 and d m {v) > 1. In 
the sequel, the graph Q is always assumed to be strongly connected. 

To perform an asymptotic analysis on the performance of the algorithms in the paper, we 
consider a family T = {Gi}fZi °f graphs with increasing number of vertices. We say that the family 
of graphs T has a bounded out-degree if 

3d G N such that V£i G F : (f u *(£i) < d . 

Similarly, the family T is said to have a bounded in-degree if 

3d G N such that V& G J" : d in (&) < d . 

Examples of families of bounded degree graphs include: a family of directed symmetric chain 
graphs with d = d = 2; a family of g-ary rooted directed trees with d = q and d = 1; a family of 
directed symmetric toroidal lattices with d = d = 4; and a family of ring graphs with d = d = 2. 

Throughout this paper, we consider the following communication scenario: initially, each vertex 
has only the knowledge of its successors and, possibly, its predecessors (the global structure of the 
graph is unknown). The representation each vertex has of its neighbors is relative, meaning that 
there is no global indexing of nodes in the graph0 Also, m distinct bitstrings are initially dispersed 
throughout the network (each bitstring is initially held by one vertex, different bitstrings may be 
held by different initial vertices). The bitstrings are assumed to be drawn from a very large set, 
such as K*, where K is a finite field (not necessarily binary) and t is an integer. 

The goal of the covering problem is to deliver all m bitstrings to all vertices in the graph. 
The transmission can be carried out either in synchronous or in asynchronous mode. In either 
setting, at most one bitstring is transmitted at a given time from a vertex to one of its successors. 
Typically, at each point of time, a vertex can transmit to its successors at most one bitstring. 

In the synchronous model, the network is provided with an universal clock, such that all the 
transmissions proceed in synchronized rounds, and all vertices can transmit at each round. 

In the asynchronous model, each vertex is equipped with a clock that ticks according to a rate- 
1 Poisson point process. The clocks at different nodes are assumed to be independent of each 
other. When the clock ticks at a given vertex, that vertex can transmit (although it may also 
decide not to do so). Via the additive property of Poisson processes, the collection of clock 
ticks across the network can be viewed as being generated by a single rate-n Poisson process, 
where each clock tick is assigned uniformly and independently to one of the vertices. Hence, 
the next clock tick is always uniform and i.i.d. over the vertices in the network. 

In both synchronous and asynchronous setups, we consider a call model; i.e., where the trans- 
mitting vertex calls a communicating partner chosen from all its successors. As it was observed 
in the gossip algorithm typically consists of two decision layers. The first layer, a basic gossip 
algorithm, is related to the decision made by a vertex about which of its successors to contact 
when its clock ticks. This decision can be made in either deterministic or randomized manner. For 
example, vertex can select one of its successors, at random using uniform distribution. 

1 This setup is referred to as anonymous networking in the literature, see, e.g., pQ. 
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The second layer, a gossip-based protocol, is related to what content is transmitted to the 
successor and how the internal status of the transmitter and receiver is updated. This content can 
be just one of the bitstrings the transmitting vertex already has in its possession, or it can be a 
linear combination of some bitstrings, akin the network coding (see [5]). 

We consider three problems that are specific cases of what is usually called gossip covering or 
information spreading problems. Randomized gossip algorithms are usually employed to solve these 
problems. The three problems are the following. 

1. Expected covering. In expected covering, the objective is that each vertex in Q receives all 
bitstrings at least once in expectation after an infinite number of clock ticks. 

2. Almost sure covering. In almost sure covering, the objective is that, after an infinite 
number of clock ticks, each vertex in Q has received all the bitstrings with probability one. 

3. Sure covering. In sure covering, the objective is to guarantee (i.e., for every sample paths) 
that each vertex in Q receives all the bitstrings after a finite number of transmissions. 

There exists a hierarchy between those three problems as stated in the following theorem, which 
is proved in Appendix lAl 

Theorem 1. Sure covering problem is strictly more difficult than almost sure covering problem 
that is itself strictly more difficult than expected covering problem. 

By slightly abusing the language, we sometimes say the function /(•) is a bound for the covering 
problem, if /(•) is a bound for all three covering problems. 

We define two quantities characterizing the gossip algorithms: the delay and the number of 
transmissions. 

Definition 1. In a synchronous gossip algorithm: the delay is the total number of transmission 
rounds. 

In an asynchronous gossip algorithm: the delay is the total number of times any vertex is 
allowed to transmit (i.e., the number of global clock ticks). 

Definition 2. The number of transmissions in a (synchronous or asynchronous) gossip algorithm 
is the total number of actual transmissions used in the algorithm. 

The number of transmissions in the asynchronous time setting is always smaller or equal to 
the delay. It is also smaller or equal to n times the delay of the same algorithm in synchronous 
time setting. The delay characterizes the time required to solve one of the covering problems. The 
number of transmissions is a different measure, which is significant if the main objective is to save 
non-necessary transmissions (for instance, the power consumption, given that each transmission 
consumes a constant amount of energy). Note that these quantities tend to infinity as n grows. In 
that case we are interested in comparing the asymptotic behavior of ratios of these quantities for 
different algorithms. 

3 Algorithms 

We consider three different algorithms: for, nca and fwr. These algorithms are executed at each 
vertex in the process of bitstring transmission and receiving. We denote by S(v), V(v) and C(v), 
respectively, the set of successors, the set of predecessors, and the set of bitstrings known to a 
vertex v at some time during execution of the algorithm. 
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3.1 Classical forwarding (for) 

The for algorithm is straightforward. Each vertex v chooses one of its successors randomly and 
uniformly from S(v), and also chooses one bitstring randomly and uniformly from C(y), and sends 
the chosen bitstring to the chosen successor. If it owns no bitstring, then it performs no trans- 
mission. Note that in some literature this algorithm is referred to as the push version of for. A 
pseudo code version of for is presented in Algorithm [TJ 

Sending a bitstring: 
begin 

if C(v) j£ then 

select v' at random in S(v); 
select c at random in C(v); 
send c to v'\ 
end 
end 

Receiving a bitstring c: 
begin 

I C(v) <- C(v) U {c}; 
end 

Algorithm 1: Classical forwarding (for) algorithm for vertex v. 



3.2 The network coding approach (nca) 

In the network coding approach, the bitstrings are assumed to be vectors over the finite field K. In 
each transmission step, the vertex sends a random linear combination of the bitstrings it already 
has in its possession. The coefficients of the vectors in this linear combination are selected randomly 
and uniformly from K. Denote by cx,C2, • • • , c m the original bitstrings. 

In this approach, the receiving vertex has to be able to recover the original bitstrings from the 
received linear combinations. To do so, it must obtain m linearly independent vectors. Moreover, 
the coefficients of the encoding must also be known to the receiver. Thus, the transmitted message 
contains both a header with the coefficients and the information payload. In other words, the 
message is a pair (c, w) where w = {w\, w 2 , • • • ,w m ) is a vector of m elements in K, and c = 

The receiving vertex stores in its memory the list of pairs received so far, 

n{v) = { (ci,wi) y. =1 , 

where r is the current size of the list, and Wi = (uii t i,Wi t 2, • • • ,tt>i,m)- For convenience, define the 
r x m matrix W = («>ij)<=i, v . , r; j=1>2 ,... , m - 

As the size of 7Z(v) grows, the vertex may try to solve the corresponding system of linear equa- 
tions in order to retrieve the original bitstrings. Pseudo code for nca is presented in Algorithm [2j 

Lemma [T] gives a relation for the delays and number of transmissions between nca and for: 

Lemma 1. For any graph, the delay and the number of transmissions required by nca are at 
least that required by for for the same objective (expected or almost sure covering) with a single 
bitstring. 
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Sending a bitstring: 
begin 

draw a vector a = {a\, c*2, • • • , a r ) uniformly at random in K7'; 
select a successor v' at random in S(v); 
send (c = Yll=i a i£i> w = atW) to u'; 
end 

Receiving a bitstring (c, w): 
begin 

add (c, w) to 7^(i>); 

r <— r + 1 ; 

check for a full-rank m x m submatrix Z of W ; 
if Z is invertible then 

| inverse Z and find the initial bitstrings; 
end 
end 

Algorithm 2: Network coding approach (nca) algorithm for node v. 

Proof. Let us consider nca on a graph with m bitstrings. In this case, typically the vertex sends 
out a linear combination of the received messages. It is straightforward to see that the propagation 
of messages in nca is analogous to the propagation of an initial bitstring in for. 

Since it is clear that a vertex that receives no message cannot retrieve the initial bitstrings, the 
delay and the number of transmissions needed in nca is greater than or equal to that of for. □ 

Note that in both for and nca the selection of the receiving vertex is fully random. Therefore, 
these algorithms cannot solve the sure covering problem for most of non-trivial graphs. 

3.3 Forwarding without repeating (fwr) 

When taking a closer look at for, it appears that time and transmissions are wasted when a vertex 
receives a message it already has. This situation, in particular, happens when the same predecessor 
sends the same message to the same successor twice. As we show below, a simple amendment 
to for, which prevents repeated transmissions of the same bitstring, can lead to a meaningful 
improvement. 

In the fwr algorithm, each vertex stores in its memory the list of pairs C(v) C C(v) x S(v) of 
bitstrings and successors it has sent messages to, including which specific bitstrings were sent, and 
it also records the predecessors from which it has received messages. By using this data, the vertex 
will not send a bitstring c to a successor v' if the pair (c, v') already appears in C(v). Thus some 
redundant transmissions may be avoided. Note that we slightly abuse the notation by treating 
C(v) as a set. 

Pseudo code version for fwr is given in Algorithm [3l Note that for the sake of simplicity, Algo- 
rithm [3] is not presented in a way consistent with the separation in two distinct layers (basic gossip 
algorithm and gossip-based protocol). However, it is straightforward to change the presentation to 
make it consistent with that model. 

When a vertex receives a bitstring c from predecessor v', it stores the pair (c,v') in the list 
C(v). However, in the analysis in this paper, for the sake of simplicity we assume a slightly weaker 
version of the algorithm fwr. More specifically, we assume that upon receiving a message, the 
vertex performs no update of the list C{v). 
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Sending a bitstring: 
begin 

if (C(v) x S(v)) \C(v) ^ then 

select a pair (c, v') uniformly at random in (C(v) x S(v)) \C(v); 
send the bitstring c to v'; 
add (c, u') to C(v); 



end 

Receiving a bitstring c from i/: 
begin 

| add (c, i/) to 
end 

Algorithm 3: Forwarding without repeating (fwr) algorithm for node v. 

Since fwr can be viewed as an improved version of for, where some redundant transmissions 
are avoided, it is immediate that the delay and the number of transmissions required using fwr is 
less or equal to those required when using for. 

In the following sections, we derive some bounds for covering problems for both synchronous 
and asynchronous settings. 

4 Synchronous time setting 

First, we derive a lower bound on the covering delay for any algorithm and any m under the 
synchronous update model. Assume that only one bitstring is used in the network. The number of 
vertices that have obtained the bitstring cannot grow by more than a factor of two during one step 
of the algorithm, which corresponds to the case where all vertices having the bitstring send it to 
distinct vertices that do not have it yet. It follows that the delay for solving the covering problem 
using any algorithm on Q, T>(Q), is at least: 



This lower bound is not tight for all graphs. An alternative lower bound depends on the bitstring 
diameter of a graph. The bitstring diameter D C {Q) of the graph is the maximum length of a shortest 
path between a vertex that initially has a bitstring and any other vertex. 

Theorem 2. In the synchronous time setting, a lower bound for covering delay is: 



The proof of Theorem [2] appears in Appendix iBl 

By using fwr, it is immediate that in at most m ■ d out (v) steps (corresponding to the case where 
v has all m bitstrings), all the successors of v will obtain the bitstrings known to v. By using this 
principle recursively, one can derive the following upper bound on the covering delay: 

Lemma 2. When using fwr on a graph Q in synchronous settings, the covering delay is at most: 



end 




(1) 



V(G) > D C {G) . 



V(Q) <m-d 



(2) 
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It follows from (JTJ) and Theorem [2] that the delay in fwr is at a constant factor m ■ d out {Q) 
from the optimal number of steps, for any family of graph with bounded out-degree and bounded 
number of initial bitstrings. 

For specific graphs we can provide tighter bound on the sure covering time, as we state in the 
following theorem, which is proved in Appendix [Cl 

Theorem 3. Let Q be a directed tree with root vo, and assume that vq has the bitstring c. Consider 
synchronous gossip algorithm fwr applied to this graph. Then, the number of transmission rounds, 
M(G), needed for sure covering of Q is upper bounded by 

M{G) < max J ^d out {v) 

where <3? runs over all paths from vq to the leaves of G, and where d out (v) denotes the out-degree 
of v in the tree Q. 

5 Asynchronous time setting 

Similar to the scenario of the synchronous time setting, we can derive a trivial lower bound on the 
covering number of transmissions in asynchronous mode on graph Q. Moreover, since the number 
of transmissions is smaller than the delay, this bound also holds for the covering delay. This lower 
bound is 

M{G) > n - 1 , (3) 

and is a direct consequence of the fact at most one new vertex receives the bitstring at each 
asynchronous step (and n — 1 vertices initially do not have one of the bitstrings). 

As far as the number of transmissions is concerned, we can derive some upper bounds for fwr. 
Those are presented in Theorem [3] (which is proved in Appendix [D]) and Corollary [TJ 

Theorem 4. The number of transmissions required to solve the covering problem using fwr on G 
is 

Af(G) < m\£\ . 

Corollary 1. As a direct consequence of Theorem 01 we have: 

M{G) < mn ■ d out {G) ■ 




It follows from ([3|) and Corollary Q] that fwr is asymptotically optimal (up to a multiplicative 
constant) in the number of required transmissions when applied to a family of graphs with bounded 
out degree and with a bounded number of initial bitstrings. 

Let us now focus on estimating bounds for the number of transmissions required when using 
for for any family of graph with bounded in-degree and bounded number of initial bitstrings. (See 
Appendix [E] for an analysis tailored specifically to the family of chain graphs.) 
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Theorem 5. Assume that for is used in asynchronous time setting on the graph Q. Then, there 
exists vertex v € V, such that the probability that v received all bitstrings after at most s trans- 
missions, Pg(v), satisfies: 



Proof. First, note that the expression does not depend on the number of initial bitstrings. It is 
obvious that a larger number of bitstrings requires more transmissions using f or and thus we focus 
on the simple case where m = 1. Let c be the corresponding bitstring. 

It will be convenient to say that a vertex of a graph is contaminated if it has obtained the 
bitstring c. After s transmissions, the set of contaminated vertices in the graph Q is denoted C s . 
Assume that Q contains at least 2 vertices, and take some k € N such that n > k > 1. Run for until 
the point when \C S \ = k for the first time, and let s be the corresponding number of transmissions (s 
is well defined since the size of C s increases by at most one after each transmission) . The probability 
that the next vertex to receive the bitstring at the iteration s + 1 is a given vertex v can be upper 
bounded. As a matter of fact, the algorithm will use any edge at random among those starting at 
vertices in C s . We note that, since \C S \ = k and since the graph is strongly connected, the number 
of such edges is at least k (recall that for each v G C s , we have d out (v) > 1). At most d m (v) of 
these edges are connected to a given vertex v ^ C s . Note that in contrast to the vertices that are 
drawn uniformly in the asynchronous time setting version of for, the probability to use one edge 
is not uniform if the degrees of vertices are not equal. However, the ratio between the probability 
to choose the most probable edge and the less probable edge is at most d OMt (Q). Therefore, the 
conditional probability that the vertex v will receive the bitstring at the transmission s + 1 is 

P.s+l(v) < 



edges connecting a vertex in C s to v: 


■ d out {Q) 




edges starting at vertices in C s \ 





d out {g) ■ d in {v) 
- k ' 

(This probability is conditioned on the assumption that \C S \ = k, and that v £ C s , as we mentioned 
above.) 

Now, run for until the point when \C S \ > n/2 for the first time, and let so be the corresponding 
number of transmissions. Select a vertex v that was not contaminated in the course of the first sq 
transmissions (which is well defined since n > 2). The conditional probability (conditioned on the 
assumptions |C So | > n/2 and v £ C s ) that v is contaminated after so + 1 transmissions is therefore: 

2.^(g).^ W 
n 

Since all transmissions are independent of each other, the probability that v is contaminated 
after s = sq + s' transmissions is: 

n>) < 1 _( 1 _»-o~(W-<«» ' 

/ Is' ■ d out (g) ■ d in (v) 

< 1 — exp 

V n 

Since s' < s, this completes the proof. □ 
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for 


nca 


f wr 


Expected covering 


u(n) 


uj(n) 


9(n) 


Almost sure covering 


u)(n) 


u(n) 


9(n) 


Sure covering 


N/A 


N/A 


9(n) 



Table 1: Comparison of the number of transmissions for the different algorithms and different 
problems when considering asynchronous time setting and a family of graphs with bounded in- and 
out-degree and bounded number of initial bitstrings. 

From Theorem [5] we arrive at the following conclusion. 

Corollary 2. Suppose that the algorithm for is applied in the asynchronous setting to a family 
of graphs with bounded in- and out-degree. Then the number of transmissions s for almost sure 
covering of the graph satisfies: 

s = uj{n) . (5) 



Proof. Observe that almost sure covering implies that every vertex v £ Gi receives all the m 
messages with probability one. By picking a vertex v and applying analysis as in the proof of 
Theorem [5l we obtain that 

2s • d out {Q) ■ d in (v) 

> oo . 

n 

Denote by T = {Gi}^i the family of graphs of bounded degree. Then, 

3d e N such that Vi, Vu e ft : d in (v) < d* n (ft) < d 

and 

3d G N such that Vi : d ou \Qi) < d , 

and thus we obtain 

d • d • - -»• oo . (6) 

n 

If linin^oo j~ < 1, for some constant b > 0, condition ([6]) does not hold. Therefore, we conclude 
that s grows faster than bn for any b > 0, i.e. s = uj(n). 

□ 

Using Corollaries HJ [2] and Lemma [U we conclude that the ratio of the number of transmissions 
needed to assure almost sure covering using f wr to that using for or that using nca tends to zero 
when considering a family of graphs with bounded in- and out-degree and bounded number of 
bitstrings. Table Q] below summarizes the main results of this extended abstract. 

A Proof of Theorem Q] 

First, we prove the loose inclusions: 

• It is clear that sure covering is more difficult than almost sure covering since if all vertices 
have all bitstrings, they also have all of them with probability one. 
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• Suppose that alg is an algorithm that solves almost sure covering. After using alg on a 
graph, let us focus on a bitstring c 6 K and a vertex v £ V. Denote by Tc(v) the random 
variable that counts the number of times v received the bitstring c during the execution of 
alg for infinite number of steps. The probability that v received c at least one time is: 

oo 

P(T c (v)>l) = Y t P(Tc(v) = k) = l. 

k=l 

On the other hand, the expected number of times v received c is: 

oo 

Ex[T c (u)] = - P(Tc(v) = k) 

fc=l 

oo oo 

= ^P(Tc{v)=k)+52(k-l)-P(Tc(v) = k) 

fc=l k=2 

oo 

> £P(T c (*)=fc) 

fe=i 
= l . 

Thus alg always solves expected covering. 
Next, let us prove the strict part of the theorem statement: 

• Consider a graph that consists of two vertices v and v', and such that £ = {(v,v'), (v',v)}. 
Suppose that v initially has some bitstring c. We use the following algorithm: when v 
can transmit (which happens infinitely often under both the synchronous and asynchronous 
model), it chooses with constant probability < a < 1 to send its bitstring c to v' . This 
algorithm trivially solves almost sure covering as the probability that v' receives the bitstring 
c after s steps is: 

P v ,(s) = 1 - (1 - a) s -> 1 . 

On the other hand, it may happen with probability that at each step, v does not commu- 
nicate with v' and thus this algorithm does not solve the sure covering problem. 

• Using the same graph, let us now consider the following algorithm: the first time v transmits, 
it draws a random variable with probability 1 /2 to be one and probability 1 /2 to be zero. If 
the obtained value is one, then v sends its bitstring to v' exactly two times in a row; if it is zero, 
then it never sends its bitstring to v' . Trivially, the probability that v' receives the bitstring 
is 1/2, which means that this algorithm does not solve the almost sure covering problem. On 
the other hand, the expected number of times v' receives the bitstring is 2 • 1/2 = 1, which 
means that this algorithm solves the expected covering problem. 



B Proof of Theorem [2] 

Let v G V be a vertex that initially has some bitstring c and let v' G V be another vertex, such 
that there exists a shortest path starting at v and ending at v' of length D C {Q). We use the fact 
that if the probability that v' received c is zero, then the expected number of times v' received c is 
also zero. We now proceed by induction: 
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Figure 1: Chain graph where vertex Vi is connected to vertices Vi—i and Uj+i. 

• At the beginning, the only vertex in the graph that has a nonzero probability to have c is v. 

• After step s, only the vertices that are at a distance less than or equal to s from v have a 
nonzero probability of having received c. At step s + 1, the gossip transmitter constraint 
implies that only the successors of vertices that have c may have a nonzero probability to 
receive c. 

C Proof of Theorem [3] 

First, observe that the graph Q is covered if all its leaves have obtained the bitstring c. This is due 
to the fact that every vertex in the tree Q is located on the path from the root vq to some leaf. 

Second, let us estimate the number of transmission rounds required to deliver c to a particular 
leaf v. Let $ be a path from vq to v, where $ = (vo, vi, V2, ■ ■ ■ ,V£ = v). In the worst case scenario, 
Vo will send c to all its other successors before it sends c to v\. Then, in the words case scenario, 
v\ sends c to all its other successors before it sends it to v%, and so on. In total, bitstring c will be 
sent to ^u'g^^K) vertices when it reaches v. 

The final expression is obtained by taking the leaf v which maximizes the required expression. 

D Proof of Theorem |4] 

To obtain this result, we first observe that the quantities (for all v G V) m|5(t>)| — |£(f)l are 
non-increasing with the number of transmissions. More precisely, at each transmission, one of the 
quantities m\S(v)\ — \£(v)\ decreases by one. Therefore, an upper bound for the number of steps 
before all £(u)'s are empty is: 

M{Q) < ^2m\S(v)\ - \C(v)\ 

i-GV 

— 171 1*^^)1 

< m\£\ 

E Analysis of for on the chain graph 

Theorem 6. Solving expected covering problem using for on a family of graphs with bounded 
out-degree in the asynchronous time setting requires u(n) transmissions. 

Proof. Take the symmetric directed chain (di)graph depicted in Figure Q] that consists of n + 1 
vertices. This graph is defined by £ = {(vi, Uj+i)}^o U v i)}i=o • ^ ^ ne ver t ex owns the 

bitstring c, it takes at least n transmissions to deliver this bitstring to v n . 

Assume that the vertex v\ has the only bitstring c, and so m = 1. Note that the number of 
vertices having the bitstring c at the end of step s, which we denote by k s , grows at most by one 
at each transmission. Let sq be the smallest number of transmissions such that k so = \n/2\. 
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Since the set of vertices possessing the bitstring is connected, the set of vertices having the 
bitstring is 

\n/2] 

u 

1=1 

This set might also contain vq, but then we ignore vq f°r the sake of simplicity of the analysis. 

Denote by Ei(s) the event that Vi receives the bitstring c at step s. We also denote by P[Ei(s)] 
the conditional probability of the event Ei(s) given that k SQ = \n/2] as above and s > sq. 

We obtain that, for s > sq and % = [n/2] +1, 

P[Ei(s)} <~\, 

since Vi-\ is the next vertex to transmit with probability l/k SQ , and Vi-i sends the bitstring to v\ 
with probability 1/2. Since k So = [n/2], this implies that 

pm*)] < \ . 

Since the transmissions are independent, the probability that Vi receives the bitstring in the 
first s transmissions given that it did not receive it in the first so transmissions, sq < s, is: 



P 



U Bk(s) 

s'=s +l 



<1- 1 



1 



n 



-so 



Take s = b ■ n + so for some constant b > independent of n. It is obvious that -P[(Js'= S0 +i Ei(s)] 
is an increasing function of s, and therefore, for any s < b • n + sq, we have: 



U Bk(s) 
s'=s +l 



< 1- 1 



n 



bn 



Since the events £'j_i(s), Ei(s) and Ei + \(s) form a Markov chain, we can now upper-bound 
P[E i+ i{s)\. 

1 



Similarly, this yields: 



P[E i+ i(s)} < 



U 

s'=s +l 



bn 



71 



< 



1- 1 



71 



This leads inductively to the formula: 



U E n-M 
s'=s +l 



< 



1-1- 



n 



bn 



n—l—i 



Now, observe that 



1-1 



n 



bn 



n—l—i 



~ e 



-(n/2)e- 
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Assume that the algorithm runs for bn + so transmissions, where b > is a constant. To 
upper bound the expected number of times the bitstring c is sent to v n during the execution of 
the algorithm, we note that the number of times v n -\ transmits c to v n is less than bn. Then, the 
expected number of times that c is sent to v n is: 

< bn ■ e-W 2 )-" 6 . 

This value vanishes exponentially fast for sufficiently large values of n. We conclude that the 
expected number of times that c is sent to v n is very close to zero. Therefore, if the number of 
transmissions in the algorithm is linear in n, the algorithm will not achieve an expected covering 
of the chain graph. □ 
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